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Abstract — Cognitive radios have been studied recently as a means 
to utilize spectrum in a more efficient manner. This paper focuses 
on the fundamental limits of operation of a MIMO cognitive radio 
network with a single licensed user and a single cognitive user. 
The channel setting is equivalent to an interference channel with 
degraded message sets (with the cognitive user having access to 
the licensed user's message). An achievable region and an outer 
bound is derived for such a network setting. It is shown that 
under certain conditions, the achievable region is optimal for a 
portion of the capacity region that includes sum capacity. 



I. Introduction 

The design of radios to be "cognitive" has been identified 
by the Federal Communications Commission (FCC) as the 
next big step in better radio resource utilization [2]. The term 
"cognitive" has many different connotations both in analysis 
and in practice, but with two underlying common themes: 
intelligence built into the radio architecture coupled with 
adaptivity. 

Cognitive radios have been studied under different model 
settings. The first models studied cognitive radios as a spec- 
trum sensing problem [3] [4] [5] [6]. Under this setting, the 
cognitive radio opportunistically uses licensed spectrum when 
the licensed users are sensed to be absent in that band. 
Problems encountered in this setup are threefold : 

1) Sensing must be highly accurate to guarantee non inter- 
ference with the licensed radio. 

2) Control and coordination between the cognitive trans- 
mitter receiver pair is required to ensure the same 
spectrum is used, and finally 

3) There are no QoS guarantees for the cognitive transmit- 
ter receiver pair. 

Other models with different side information at the cognitive 
users have been studied. In [7] and [8], the authors study 
frequency coding by the cognitive transmitter by assuming 
non causal knowledge of the frequency use of the primary 
transmitter. 

Manuscript received May 18, 2007; revised September 16, 2007; revised 
October 26, 2007. This research was supported in part by National Sci- 
ence Foundation grants NSF CCF-0448181, NSF CCF-0552741, NSF CNS- 
0615061, and NSF CNS-0626903, THECB ARP and the Army Research 
Office YIR The material in this paper was presented in part at the IEEE 
Information Theory Workshop, Lake Tahoe, CA, September 2007 [1]. 

The authors are with the Wireless Networking and Communications 
Group, Department of Electrical and Computer Engineering, University of 
Texas at Austin, Austin, TX - 78712 (email: sridhara@ece.utexas.edu; sri- 
ram @ ece. utexas . edu) . 



In this paper, we study cognition from an information theoretic 
setting where we assume that the cognitive transmitter knows 
the message of the licensed transmitter apriori. Such a model 
is interesting for two reasons : 1) It provides an upper limit, 
or equivalently a benchmark on the performance of systems 
where the cognitive radio gains a partial understanding of the 
licensed transmitter and 2) It allows us to understand the ulti- 
mate limits on the cognitive transmitter by giving it maximum 
information and allowing it to change its transmission and 
coding strategy based on all the information available at the 
licensed user. In essence, it enlarges the possible schemes that 
can be implemented at the cognitive radio, and 3) It lends itself 
to information theoretic analysis, being a setting where such 
tools can be applied to determine the performance limits of the 
system. Many other configurations, including the interference 
channel setting when the cognitive transmitter does not know 
the message of the licensed transmitter are multi-decade long 
open problems. 

The goal of this paper is to study the fundamental limits 
of performance of cognitive radios. Along the lines of [9], 
we consider the model depicted in Figure 1. In this setting, 
we have an interference channel [ 10] [ 1 1][12][13], but with 
degraded message sets, where the transmitter with a single 
message is called "legacy," "primary" or "dumb" and the trans- 
mitter with both messages termed the "cognitive" transmitter. 
Prior work on this model for the single antenna case is in 
[9][14][15][16], 

In this paper, we study the performance of the cognitive 
radio model under a multiple antenna (MIMO) setting. Both 
the licensed and cognitive transmitter and receiver may have 
multiple antennas. MIMO is fast becoming the most common 
feature of wireless systems due to its performance benefits. 
Thus, it is important to study the capacity of cognitive radios 
under a MIMO setting. There are some instances where 
the methods used in this paper bears similarities with the 
methods used for the SISO setting. However, most of the 
proofs and techniques used here are distinct and considerably 
more involved than those used in [16]. In the SISO setting, 
it is possible to analyze the model for specific magnitudes of 
channels. This is not possible for the MIMO setting. We list 
some of the crucial differences between the methods used in 
this paper and the methods that have been used under the SISO 
setting. 

1) In [16], the authors obtain the outer bound using 
conditional entropy inequality. This method cannot be 
extended to the MIMO setting. 

2) We obtain the outer bound through a series of channel 
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transformations. Although the channel transformations 
are similar in spirit to those in [15], the actual transfor- 
mations used are significantly different both in nature 
and in the mathematical proofs that accompany them. 
In [15], the authors reduce the channel to a broadcast 
channel where the combined transmitters have individual 
power constraints and the cognitive receiver has the 
message of the licensed user provided to it by a genie. 
The capacity region for such a variation of broadcast 
channel is not known in general. The authors solve 
for the capacity region of the broadcast channel using 
aligned channel techniques. On the other hand, we re- 
duce the MIMO cognitive channel to a broadcast channel 
with sum power constraint and whose capacity region 
is now known [ 17] [ 1 8] [ 19]. We then use optimization 
techniques to compare the achievable scheme with the 
outer bound. 



A. Main Contributions 

In this paper, our main contributions include: 

1 . We find an achievable region for the Gaussian MIMO cog- 
nitive channel (MCC) in a fashion analogous to [9] [15] [16]. 

2. We find an outer bound on the capacity region of the MCC. 

3. We show that, under certain conditions (that depend on the 
channel parameters), the outer bound is tight for a portion of 
the capacity region boundary, including points corresponding 
to the sum-capacity of the channel. Combining the two above, 
we characterize the sum capacity of this channel and a portion 
of its entire capacity region under certain conditions. 



B. Organization 

The rest of the paper is organized as follows. We describe the 
notations and system model in Section [II] The main results 
are presented in Section [III] In Section IIV1 we present an 
achievable region for the Gaussian MIMO cognitive channel 
(MCC). An outer bound on the capacity region is shown 
in Section [V] The optimality of the achievable region for a 
portion of the capacity region (under certain conditions) is 
shown in Section[Vl] Numerical results are provided in Section 
[Vll] We conclude in Section [VlTTl 



II. System Model and Notation 

Throughout the paper, we use boldface letters to denote 
vectors and matrices. |A| denotes the determinant of matrix 
A, while Tr(A) denotes its trace. For any general matrix 
or vector X, denotes its conjugate transpose. I n de- 
notes the n x n identity matrix. X™ denotes the row vector 
(X(l),X(2),...,X(n)), where X(i),i = 1, 2, . . . ,n can be 
vectors or scalars. The notation H >z is used to denote that 
a square matrix H is positive semidefinite. Finally, if S is a 



set, then C1(S) and Co(S) denote the closure and convex hull 
of S respectively. 

We consider a MIMO cognitive channel shown in Figure 1. 
Let n Pt t and n p>r denote the number of transmitter and receiver 
antennas respectively for the licensed user. Similarly, n c ,t an d 
n c , r denotes the number of transmitter and receiver antennas 
for the cognitive user. 
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Licensed Receiver 
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Fig. 1. MIMO Cognitive Radio System Model 

The licensed user has message m p <G {1, 2, . . . , 2 nRp } in- 
tended for the licensed receiver. The cognitive user has mes- 
sage m c 6 {1,2,..., 2 nRc } intended for the cognitive receiver 
as well as the message m p of the licensed user. 

The primary user encodes the message m p into X p ™. Here, 
X p (i) is a n Pt t length complex vector. The cognitive trans- 
mitter determines its codeword X c ™ as a function of both 
nip and m c . Note that the cognitive transmitter wishes to 
communicate both m p (to the licensed receiver) and m c (to 
the cognitive receiver). The channel gain matrices are given 
Ip p, Hp c , H c p and H c c , and are assumed to be static. 
It is assumed that the licensed receiver knows H pp ,H cp , 



by H 

It is ; 

the licensed transmitter knows H p p . It is also assumed that 
the cognitive transmitter knows H cp ,H pc ,H cc and the 
cognitive receiver knows H p c , H c c . The received vectors of 
the licensed and cognitive users are denoted by Y p ™ and Y c ™ 
respectively. 

With the above model and notations, we can describe the 
system at time slot i by 



Y p (i) = Hp. p Xp(i) + H CjP X c (i) + Zp(i) 
Yc(i) = Hp. c Xp(i) + H c , c X c (i) + Z c (i). 



(1) 



The additive noise at the primary and secondary receivers is 
denoted by Z p " and Z c " respectively. The noise vectors Z p ™ 
and Z c " are Gaussian and are assumed to be i.i.d. across 
symbol times and distributed according to Af(0,I n r ) and 
Af(0, I n „) respectively. The correlation between Z p " and 
Z c " is assumed to be arbitrary. This correlation does not 
impact the capacity region of the system as the licensed and 
the cognitive decoders do not co-operate with each other. Q 

We denote the covariance of the codewords of the licensed 
and cognitive transmitters at time i by E p (i) and S c (i) 

'A proof of this can be obtained using steps almost exactly identical to 
those for the broadcast channel in [20, Exercise 15.10] 



3 



respectively. Then, the transmitters are constrained by the 
following transmit power constraints. 



Er=i Tr ( s p(*)) < np p 

£r =1 Tr(53 c (i))<nP c . 
A rate pair (R p , R c ) is said to be achievable if 



(2) and K 



1) there exists a sequence of encoding functions for the 
licensed and cognitive users E p : {1, . . . , 2 nRp } — > 
X p " and E™ :{!,..., 2 nR "} x {1, . . . , 2 nR -=} -> X c ™ 
such that the codewords satisfy the power constraints 
given by (|2}, 

2) there exists decoding rules D" : Y p n — > {1, . . . , 2 nR p} 
and D™ : Y c ™ -> {1, . . . , 2 nR -} such that the average 
probability of decoding error is arbitrarily small for 
suitably large values of n. 

The capacity region of the Gaussian MIMO cognitive channel 
is the set of all achievable rate pairs (R p , R c ) and is denoted 
by Cmcc- 



Let 53. be a covariance 



We now describe an outer bound on the capacity region of the 
MIMO cognitive channel. Let a > 0, G„ = H r 
Hp, p H C)P /ya 
H c , c /V^ _ 
matrix of dimensions (n p>r + n c>r ) x (n PtT + n c , r ) and of 
the form 

" K , Q 
O f I 



(6) 



Here, Q z is a n PiT . x n c , r matrix that makes 53 z positive 
semidefinite. Let TZ"^ denote the set described by 



(R p , R c ), Q p , Q t 

R P >0,R C > 0, Q p h 0, Q c h 

i? P < log|l + G Q Q p G^ + G Q Q c Gt Q | i 
-log|l + G Q Q c Gt Q | 



i? c < log 



S z + KQ c Kt 



log|S z 



Tr(Qp) + Tr(Q c ) < P p + aP c 



III. Main Results 

In this section, we describe the main results of the paper. Let 
G = [Hp.p H c p ]. Let IZach denote the set described by 



(Rp, Rc), Sp, S CjP , S c .c, Q 



R P >0,R C > 0, S p h 0, E c ,p h 0, S c , c h 

Rp < log |I + GSp. n etGt + Hc.pSc.cHt p | 

-log|l + H CjP S c>c Ht )P | 
R c < log|l + H c , c S c , c Ht c | 
S P Q 

Q f S c . p 

Tr(Sp) < P p , Tr(S CiP + S c , c ) < P c 



J p,net 



>- o, 



} ■ 0) 



In this setting, S p . ne t is a (n Pit + n c ,t) X (n p ,t + n C)t ) 
covariance matrix while S c c is a n c t x n c t covariance matrix. 
S p and S c ,p represent principal submatrices of Sp, n et of 
dimensions n Pi t x n Pi t and n c t x n Cj t respectively. The 
covariances matrices S p , S c p and S c c determine the power 
constraints of the system. 

Let TZi n denote the closure of the convex hull of the set of 
rate pairs described by 

(R P ,R C ) : 3 Sp,S c . p ,S c , c ,Q, and 

(4) 

ich 



(R p , Rc), 5j p , S c p , S c c , Q ) £ 7l„ 



Theorem 3.1: The capacity region of the MCC, Cmcc satis- 
fies 

TZin Q Cmcc- (5) 

The proof of the theorem is given in Section |IV] The coding 
strategy is based on Costa's dirty paper coding [21][22]. 



Let lZg Ut z denote the closure of the convex hull of the set of 
rate pairs described by 

{ (R p , R c ) : 3Q P , Q c h 0, ((R P ,R C ), Q P , Q c ) € } . 

(8) 

Also, let TZout be represented as 

n out = f) f) n a out . (9) 

S = Q>0 

Then, the next theorem describes an outer bound on the 
capacity region of the MCC. 

Theorem 3.2: The capacity region of the MCC, Cmcc satis- 
fies 



Cmcc Q ft^-, Va >0, 53. 
Cmcc Q TZout- 



(10) 



The proof is given in Section [V] and proceeds by a series of 
channel transformations. Each channel transformation results 
in a new channel whose capacity region is in general a superset 
(outer bound) of the capacity region of the preceding channel. 

Let BC(Hi,H 2 ,P) denote a two user MIMO broadcast 
channel with channel matrices given by Hi and H2 and with 
a transmitter power constraint of P. Let C^ H2 ' F denote the 
capacity region of BC(Hi, H2, P). 

Let TZp art conv denote the set described by 



(R P , R c ), Q P - S c 



R P >0,R C > 0,Q P h 0,S C , C h 0, 
R p < log |I + G Q Q p Gt + iH c . p S c , c E 
-log|l+iH CiP 53 CiC Ht jp | 

R c <log|l+iH Cl c53 C)C Ht )C | 
Tr(Qp) + Tr(E C:C ) < P p + aP c 



t 

c,p I 



(11) 
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We let TZp art out to denote the closure of the convex hull of 



the set of rate pairs described by 

(Rp,R c ) : 3Q p ,E c , c h and 

((Rp, Rc)> Qpj -^c,c) € T^-part,conv 



(12) 



Let K = [0 Hcc/V^]- We show that if the boundary of the 
rate region described by lZp art out partially meets the bound- 
ary of the capacity region of BC(G a , K, P p + aP c ), then the 
boundary of 7Zp art out partially meets the boundary of the rate 
region described by in (0 for some E z . We formally 

state the result in Theorem 13.31 For notational convenience, 
we will denote the capacity region of BC(G a , K, P p + aP c ) 
by C a BC . 

Theorem 3.3: Let /i > 1 and a > 0. If 

max uR„ + R c = max uR p + R c , (13) 

(Rp,Rc)eK% artiOUt (R p ,R c )ec* c 

then, we have 

max uR„ + R c = inf max u,R v + R c . 

(14) 

The proof of the theorem is described in Section [V] Hence, 
if the condition (fT3l is satisfied, the rate region described by 
lZp art out is an outer bound on the capacity region of the MCC 
in terms of maximizing the /i- sum fiR p + R c . 

Let (R p , R c ) be a point on the boundary of the capacity region 
Cmcc- Then, there exists a /i > such that 



(R p , R c 



arg max fiR p 

(R P ,Ra)eC MC c 



Rr 



The next theorem shows that if (R p , R c ) lies on the boundary 
of the achievable region given by 7Zi n , then (R p ,R c ) lies 
on the boundary of lZp art out for some a > 0. That is, the 
theorem describes conditions of optimality of the achievable 
region H ln . 

Theorem 3.4: For any /i > 0, 

max nR„ + R c = inf max nR„ + R c . 

(R p ,R c )GK in y a>0 (R p ,R c )eTl« art out 

Also, there exists a* <G (0,oo), such that for any \i > 1, 
(R Pt(1 , R C:P ) = argmax (/?p . i?c)eKin uR p + R c is a point on 
the boundary of the capacity region of the MIMO cognitive 
channel if the condition given by ( fT3l is satisfied for a* . 

The proof of the theorem is described in Section |VT] and is 
based on optimization techniques. 



IV. Achievable Region 



Proof of Theorem li.il : In this section, we show that the rate 
region lZi„ given by © is achievable on the MCC. 

Encoding rule for Licensed user (Ep ) : For every message 
nip £ {1, . . . , 2 nRp }, the licensed encoder generates a n length 
codeword X p "(m p ), according to the distribution p(X p ") = 



n? =1 p(X p (i)), and X p (i) 
Tr(S p ) < Pp. 



< Af(0, £ p ) such that E p X and 



Encoding rule for the cognitive user The cogni- 

tive encoder acts in two stages. For every message pair 
(m p ,m c ), the cognitive encoder first generates a codeword 
X c p n (mp, m c ) for the primary message m p according to 
iy= lP (X c , p (t)\X p (i)), where p(X CjP (i)) -7V(0,E C , P ) and 
the joint distribution of (X p (i), X c p (i)) is given by 



p(x p (i),x c ,p(i))-jv o, 



Q 



Q 



(15) 



Here, Q denotes the correlation between X p (i) and X c p (i). 
In the second stage, the cognitive encoder generates X c c ra 
which encodes message m c . The codeword X c c " is generated 
using Costa precoding [21] by treating H PjP X p n +H c . c X CiP n 
as non causally known interference. A characteristic feature 
of Costa's precoding is that X c c " is independent of X c p ", 
and X c c ™ is distributed as n" =1 p(X c c (i)), where X c c (i) ^ 
jV(0, E c c ). Note that the codeword X c p ™ is used to convey 
message m p to the licensed receiver and the codeword X c c ra 
is used to convey message m c to the cognitive receiver. The 
two codewords X c p " and X c c ™ are superimposed to form 
the cognitive codeword X c " = X c p ' 1 + X c c ™. It is clear 
that X c n is distributed as n™ =1 p(X c (i)), X c (/) - 7V(0, E c ), 
where E c = S c p + E c c . The covariance matrices satisfy the 
constraints E C!P X 0, E c ^ c X 0, Tr(E c ) < P c . 



receiver receives H p p X p 



Decoding rule for the licensed receiver (Dp) : The licensed 

H C , P (X C . P ™ + X C!C ") + Z p ". 
as the valid codeword 
Zn" as Gaussian noise. Taking G = 
X n 



It treats H p , p X p 
and H CjP X C!C " -+ 



aj an d X p ne t 

at the licensed receiver is 



X r 



, the received vector 



GX 



p,net 



H C; pX c c 



(16) 



The covariance matrix of X p ne t is denoted by E p net 



Q 



Q T S c , p 



, where Q 



£[X p Xj. ]. In this setup, we 

use steps identical to that used for MIMO channel with colored 
noise in [20, Section 9.5] to show that, for any e > 0, there 
exists a block length n\ so that for any n > n\, the licensed 
decoder can recover the message m p with probability of error 
< e if 



R P < log |I 



GE pne tG^ 



log I 



H UI- 



= Ht c,pl 



(17) 



Decoding rule for the cognitive user (-D™) : The cognitive de- 
coder is the Costa decoder (with the knowledge of the encoder, 
E™). The cognitive receiver receives Y c ™ = H p c X p " + 
H c .c(X c . p " + X c c ") + Z c n . Here, the non-causally known 
interference H p c X p ™ + H C C X C p " is canceled by the Costa 
precoder. To show this formally, we follow steps similar to 
Eqns (3) to (7) in [21]. We get that, for any e 2 > 0, there 
exists 71,2 such that for n > ?i2, the cognitive decoder can 
recover the message m c with probability of error < 62 if 

i? c <log|l + H CiC E c ,Hl c |- (18) 
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Note that the achievable scheme holds for all possible covari- 
ance matrices S p ,X cp ,X cc that are positive semidefinite 
and satisfy the power constraints Tr(£ p ) < P p ,Tr(S c p + 
S c .c) < Pc- Hence, lZi n , which is the set of all achievable 
rate pairs described by ®, is achievable for any code length 
n > max(ni, 712). ■ 



V. Outer Bound on the Capacity Region 

In this section, we prove that the rate region described by 
TZ-o^* is an outer bound on the capacity region of the 
Gaussian MIMO cognitive channel. The proof proceeds by 
a series of channel transformations where each transformation 
creates an outer bound on the channel at the previous stage. 
At the final stage, we obtain a physically degraded broadcast 
channel. The capacity region of this channel is now known 
[17] [18] [19] and is used as the outer bound for the capacity 
region of the MIMO cognitive channel. Figure 2 depicts the 
various channel configurations considered, and the system 
equations of all the configurations. Z™ shown in Figures 2c, 2d 
and 2e has the same distribution as Z p ™, but has an arbitrary 
correlation with Z c ". Before proving Theorem 13.21 we prove 
the following lemmas. 

Transformation 1 (MIMO Cognitive Channel (MCC) — > 
Scaled MIMO cognitive channel) : The scaled MIMO cog- 
nitive channel is defined in Figure 2b and Figure 3. In 
this transformation, the channel matrices H c p and H c c are 
scaled by l/^/a. Also, the power constraint at the cognitive 
transmitter is changed to aP c . 



Lemma 5.1: The capacity region of the MIMO cognitive 
channel is equal to the capacity region of the scaled MIMO 
cognitive channel (SMCC) for any < a < 00. 

Proof : Let (R p ,R c ) be a rate pair that is achievable on 
the MCC. That is, for all 61,62 > 0, there exists a n and 
a sequence of encoder decoder pairs at the licensed and 
cognitive transmitter and receiver (E p : m p — > X p ™,Dp : 
Y p " -> m p ,E% : (m p ,m c ) -> X c ", £™ : Y c " -> m c ) 
such that the codewords X p ™ and X c ™ satisfy the power 
constraints given by (f2]i and the probability of decoding error 
is small (Pr(m p ^ rh p ) < e\,Pr(m c 7^ m c ) < 62). 
We use the following encoder decoder pairs at the licensed 
and cognitive transmitters and receivers of the scaled MIMO 
cognitive channel. : m p -> X p n , D 7 p l : Y p n ->■ m p , E™ : 
(mp, m c ) — ► v'ttXc", D™ : Y c " — > m c . It follows that using 
these encoder and decoder pairs, the licensed and cognitive 
codewords satisfy the new power constraints of P p and aP c 
respectively. Also, the system equation is the same as that of 
the MCC and Pr(m p 7^ rh p ) < t\ and Pr(m c 7^ m c ) < 62. 
Hence, the rate pair (R p ,R c ) is achievable on the scaled 
MIMO cognitive channel. Hence, the capacity region of the 
SMCC is a superset of the capacity region of the MCC. 

Similarly, we can also establish this in the other direction, 
namely we can treat the MCC as the scaled version of the 
SMCC (scaling by 1/a). Therefore, it can be shown that the 



capacity region of the MCC is a superset of the capacity region 
of the SMCC. 

Hence, the capacity region of the MCC is equal to the capacity 
region of the SMCC. ■ 

Transformation 2 (scaled MIMO cognitive channel (SMCC) 
— > scaled MIMO cognitive channel A (SMCCA)) : The scaled 
MIMO cognitive channel A (SMCCA) is described in Figure 
2c and Figure 4. In this transformation, we provide a modified 
version of Y p ", which is Y™ to the cognitive receiver. Y^ 
is corrupted by noise Z p , which has the same probability 
distribution as that of Z p ' 1 (i.e., complex Gaussian with zero 
mean and identity covariance matrix), but is permitted to be 
correlated with Z p n or Z c n . In fact, we assume that the joint 
probability distribution of (Z p (i), Z c (i)) is given by 

p(fcp(i),Z c (*))=JV(0,£ a ), (19) 

where S z has the form given by (|6). The received vector Y p 
is made available to the cognitive receiver by transforming the 



channel matrices H p c and H c c 



a to K\ 



Hp 
H r 



and 



Ko = 



H c 



respectively. Hence, the received vector 



at the cognitive receiver is 



Lemma 5.2: The capacity region of the scaled MIMO cogni- 
tive channel A (SMCCA) is a superset of the capacity region 
of the scaled MIMO cognitive channel (SMCC). 

Proof : Let the rate pair (R p , R c ) be achievable on the 
SMCC. That is, for all 61,62 > 0, there exists a n and a 
sequence of encoder decoder pairs at the licensed and cognitive 
transmitter and receiver (E™ : m p — ► X p ",£)" : Y p ™ — > 
m p ,E™ : (m p ,m c ) -> X c ™, £>™ : Y c " -»■ m c ) such that 
the codewords X p " and X c ™ satisfy the power constraints 
and the probability of decoding error is small (Pr(m p 7^ 
m p ) < t\,Pr(m c 7^ m c ) < 62). In the SMCCA, we can 
use the same encoder decoder pair E' p and D" at the licensed 
transmitter and receiver to achieve a rate R p with probability 
of decoding error < e\. Also, by ignoring the received vector 
Y p at the cognitive receiver, we can use E™ and D™ at the 
cognitive transmitters and receivers to achieve a rate R c with 
the decoding probability of error < 62. Hence, the rate pair 
(R p , R c ) is achievable on the scaled MIMO cognitive channel 
A (SMCCA). Therefore, the capacity region of the SMCCA 
is a superset of the capacity region of the SMCC. ■ 

Transformation 3 (scaled MIMO cognitive channel A (SM- 
CCA) -> scaled MIMO cognitive channel B (SMCCB) ) : The 
scaled MIMO cognitive channel (B) is described in Figure 2d 
and Figure 5. The channel matrix from the licensed transmitter 



to the cognitive receiver is modified from Ki 



H 
H 



H 



pp 




pp 

p,c 



to 



receiver is given by 



Hence, the received vector at the cognitive 
Y' 1 



where Y r n 



H c c 



x r 



The intuition behind the transformation is to remove the 
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Power Constraint %!p Power Constraint ^P Power Constraint 




Figure 2c Scaled MIMO Cognitive Channel A (SMCCA1 




Figure 2d Scaled MIMO Cognitive Channel B (SMCCB) Figure 2e Scaled MIMO Broadcast Channel A (SMBCA) Figure 2f Scaled MIMO Broadcast Channel (SMBC) 



Licensed User : Y p = H jLp X p + H, :p X, + Z p 
Cognitive User : Y r = H p , r X p + H V X C + Z, : 



Licensed User : Y p = H p , p X p + (H c , p /^/a)X c + Z T 
Cognitive User : Y r = H p , c X p + (B v /y/a)X e + Z r 



Licensed User : Y f = H rf X r + {H v /^/a)X c + Z p 

Cognitive User : Y p = H pp X p + (H e J s/a)X,. + Z p 
Y r = H pr X p + (H V I^/S)X, + Z, 



MIMO Cognitive Channel (MCC) 



Scaled MIMO Cognitive Channel (SMCC) 



Scaled MIMO Cognitive Channel A (SMCCA) 



Licensed User : Y p = H PJ X P + (H,, p /^a)X r + Z p 
Cognitive User : % = H pj ,X p + {H, :J J s/aX,. + Z p 
Y,.= m,,,.l^)X,+ Z,. 

Scaled MIMO Cognitive Channel B (SMCCB) 



Licensed User : Y p = [ H p , p H^/^/a] X + Z p 
Cognitive User : Y r = [ H PJ , H,, p /s/a] X + Z p 
Y e =[0 H v /s/a]X + Z e 
Scaled MIMO Broadcast Channel A (SMBCA) 



Licensed User : Y p = [ H pp H cf /^a ] X + Z, 
Cognitive User : Y r = [ iT^/v/a] X + Z r 

Scaled MIMO Broadcast Channel (SMBC) 



Fig. 2. Channel Configurations and their System Equations 
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original interference caused by the licensed transmitter to the 
cognitive receiver. 

Lemma 5.3: The capacity region of the scaled MIMO cogni- 



tive channel B (SMCCB) is equal to the capacity region of 
the scaled MIMO cognitive channel A (SMCCA). 

Proof : Let the rate pair (R p ,R c ) be achievable on the 
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SMCCA. This implies that for every ex,e 2 > 0, there exists 
encoder-decoder pair for the licensed user (E p (ei), D p (ei)) 
and for the cognitive user (E™(e2), D"(e2)) such that the 
probability of decoding error is less than ei and e 2 respectively 
for the licensed and cognitive user. Let 81,62 G (0,1). In 
SMCCB, the licensed user can employ E p (min((5i/2, S 2 /2)), 
D™(min(^i/2, 62/2)) to decode m p with a probability 
of error < 81 /2 < Si. The cognitive receiver uses 
££(min(<Ji/2,<f 2 /2)), D; l (min(<5i/2, S 2 /2)) on Y™ to obtain 
m p with probability of error < 81/ 2. The cognitive receiver 
can now construct X p ™ and hence H p c X p ". Thus, the 



cognitive receiver recovers Y c " = H pc X p r 



H 



Z c n . Now, it uses, E^(S 2 /2),D^(S 2 /2) to obtain m c with 
probability of error < 82/2. Clearly, the probability of error 
in recovering m c is less than 82- Hence, the rate pair (R p , R c ) 
is achievable on SMCCB. Therefore, the capacity region of 
SMCCB is a superset of the capacity region of SMCCA. 

Let the rate pair (R p ,R c ) be achievable on SMCCB. Then, 
for every ei,e 2 > 0, there exists encoder-decoder pair for 
the licensed user (E p (ei), D p (ei)) and for the cognitive user 
(-E"(e2), Z)"(e2)) such that the probability of decoding error is 
less than ei and £2 respectively for the licensed and cognitive 
user. Let <5i,(5 2 > 0. In SMCCA, the licensed user can em- 
ploy J B p "(min((5 1 /2, ( 5 2 /2)),^(min((5 1 /2, ( 5 2 /2)) to decode 
m p with a probability of error < <5i/2 < Si. The cognitive user 
employs E; l (mm(8i/2 1 S 2 /2)), D^(min(5 1 /2, S 2 /2)) on Y£ 
to obtain m p with probability of error < <5 2 /2. The cognitive 
receiver can now construct X p ™ and hence H p c X p ". Hence, 
the cognitive receiver subtracts H p c X p " from Y c ™ to obtain 
Y c . The cognitive receiver can now use E™ (82/ 2), D™ {82/ 2) 
to obtain m c with probability of error < <S 2 . Thus, the rate pair 
(i? p , R c ) is achievable on SMCCA. 

Therefore, the capacity region of the SMCCA is equal to the 
capacity region of the SMCCB. ■ 

Transformation 4 (scaled MIMO cognitive channel (B) — > 
scaled MIMO broadcast channel A (SMBCA)): The scaled 
MIMO broadcast channel A (SMBCA) is depicted in Fig- 
ure 2e and Figure 6. We let the two transmitters to co- 
operate and transform it into a broadcast channel with a sum 
power constraint of P p + aP c . The new channel matrices 
from the combined transmitters to the licensed and cognitive 



receivers are given by G a 



H r 



and 



K 



H 



pp 




Hc. P /V" 

Hc.c/v^ 



respectively. 



Lemma 5.4: The capacity region of the scaled MIMO broad- 
cast channel A (SMBCA) is a superset of the capacity region 
of scaled MIMO cognitive channel B (SMCCB). 

Proof : Let the rate pair (R p ,R c ) be achievable on the 
SMCCB. In the SMBCA, using no collaboration between the 
two transmitters and using separate power constraints of P p 
and aP c respectively, we reduce the SMBCA to the SMCCB. 
Hence, the rate pair (R p ,R c ) is achievable on the SMBCA. 
Therefore, the capacity region of the SMBCA is a superset of 
the capacity region of the SMCCB. ■ 

We have showed that for any a > 0, Cmcc = Csmcc ^= 
Csmcca = Csmccb Q C S mbca- Hence, the capacity region 
of the scaled MIMO broadcast channel A (SMBCA) is a 
superset of the capacity region of the MIMO cognitive channel 
(MCC). 



Proof of Theorem \3J\ : In the SMBCA, let Q p denote the 
covariance matrix of the codeword for the licensed user and 
let Q c denote the covariance matrix for the cognitive user. The 
SMBCA is a physically degraded broadcast channel. Hence, 
the capacity region of the SMBCA (as given by [17]) denoted 
by Cs mbca is the closure of the convex hull of the set of rate 
pairs described by 

(R P ,R C ) : R p > 0,R C > 

R p < log |I + G Q Q p Gt Q + G a Q c Gl\ 
-logll + G^QeG^ 



R c < log 



log|S 2 



(20) 



S z + KQ c Kt 

VQ P h 0, Q c X 
^ Tr(Q p ) + Tr(Q c ) < P p + aP ( 



Also, this is the outer bound of the MCC. Hence, Ti-out * 
described by ((HJ) is an outer bound on the capacity region 



of the MCC. Hence, C 



MCC 



c K 



Also, C 



MCC 



where lZ ou t is described in (9). 



Transformation 5 (scaled MIMO broadcast channel A (SM- 
BCA) ->■ scaled MIMO broadcast channel (SMBQ) : The 
scaled MIMO broadcast channel (SMBC) is depicted in Figure 
2f and Figure 7. We change the received vector at the cognitive 
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Scaled MIMO Cognitive Channel B (SMCCB) Scaled MIMO Broadcast Channel A (SMBCA) 

Fig. 6. Capacity Region of SMCCB C Capacity Region of SMBCA 



receiver from 



to Y c ™. This is done by changing Tr (Qp + Qc) < P P + aP c . Let Tl% MBC1 denote the closure 



the channel matrix from the joint transmitters to the cognitive 
receiver to K = [ H CjC /\/q 1. 

Lemma 5.5 ([23]): The capacity region of the SMBCA is a 
superset of the capacity region of the scaled MIMO broadcast 
channel (SMBC). 

Proof: Let the rate pair (R p , R c ) be achievable on the SMBC. 
That is, for all ei, £2 > 0, there exists a n and a sequence of 
encoder decoder pairs at the transmitter and the two receivers 
{E n : (m p ,m c ) -> X", D% : Y p " -> m v ,D n c : Y c " -> 
rh c ) such that the codeword X" satisfies the power constraint 
of P p + aP c and the probability of decoding error is small 
(Pr(m p ^ m p ) < e x ,Pr(m c ^ m c ) < e 2 ). 

In the SMBCA, the transmitter and the receivers use the same 
coding strategy. The licensed receiver can decode message m p 
at a rate R p . The cognitive receiver can ignore Y™ and use just 
Y c to decode message m c at a rate R c . Hence, the rate pair 
(R p , R c ) is achievable in the SMBCA. Hence, the capacity 
region of the SMBCA is in general a superset of the capacity 
region of the SMBC. ■ 



We describe one more lemma whose result will be used in the 
proof of Theorem ( 13.3b . 



Lemma 5.6 ([23]): Let Csmbc denote the capacity region of 
the scaled MIMO broadcast channel described in Figure 2f. 
Then, for any fx > 1, 



sup /iR p 

(Rp,Rc)£CsMBC 



R r = inf 



sup 

(Rp,Rc)£CsMBCA 



fiR p + R c 



The proof is described in [23, Section 5.1] and is omitted here. 



We now give the proof for Theorem (13.3 



Proof of Theorem 15.51 : It was shown in [17] that Gaussian 
codebooks (i.e., codebooks generated using i.i.d. realizations 
of an appropriate Gaussian random variable) achieve the 
capacity region for the MIMO broadcast channel. In SMBC, 
let Q p denote the covariance of codeword X™ for the licensed 
user and Q c denote the covariance matrix for the cognitive 
user. The covariance matrices satisfy the joint power constraint 



(21) 



of the convex hull of the set of rate pairs described by 
(R P ,R C ) : R p > 0,R C > 
R p < log |I + G Q Q p Gt Q + G a QcGl 

-log|l + G Q Q c Gt| 
Rc < log|l + KQ c Kt| 
VQ P h 0,Q C y 
, Tr(Q p ) + Tr(Q c ) < P p + aP c 

Similarly, let TZg MBC 2 denote the closure of the convex hull 
of the set of rate pairs described by 

(R P ,R C ) : R p > Q,R C > 

R p <log|l + G Q Q p Gt Q | 

Rc < log |I + KQpKt + KQ c K t | 
-log|l + KQ p Kt| 

VQ P t. o,Q c h 0, 

Tr(Qp) + Tr(Q c ) < P p + aP c 

The capacity region of SMBC, Csmbc is the closure of the 
convex hull of TZg MBC 1 U R-smbc i- That is, 



> ■ 



(22) 



C 



SMBC 



= Cl(Co(K a SMBCA UK a s 



SMBC,2 



))• 



(23) 



R-smbc 1 denotes the portion of the capacity region of SMBC 
where the licensed user's message is encoded first. That is, the 
cognitive receiver sees no interference. Hence, for \i > 1, we 
have 



max fiRp 



R c = max [iR p 

(Rp,Rc)£CsMBC 



Rr 



Therefore, from Lemma 5.6, we have that for /i > 1, 

fj,R 



max 

(Rp,Rc)^T^s MBC 1 



R c = inf max /iR p 

S z (Rp,Ro)eC S MBCA 



Rr 



We can see that, TZp" art out described in (fTZt is a subset of 
R^SMBC 1 f° rmec l by restricting the covariance matrix Q c to 
have the form 



s c 



Qc 



It can also be seen that 71"^* described in (0 equals 
Csmbc a- Hence, it follows that for any [i > 1 and for a > 0, 
if 



max 

(R p ,R c )eK« 



fiRp + R c 



max 

(R p ,Rc)eC° 



fiRp + R c , 



9 




Power Constraint j£ 

P, + aP„ 



Licensed Receiver 

n 



Superset 

(Partly Equal) Power Constraint 



Cognitive Receiver 



Scaled MIMO Broadcast Channel A (SMBCA) 

Fig. 7. Capacity Region of SMBCA D Capacity Region of SMBC 




Licensed Receiver 

0-^ 



[ Cognitive Receiver 

Z r 

Scaled MIMO Broadcast Channel (SMBC) 



then we have that 



max uR„ + R c = inf max uR p + R c 



VI. Optimality of the Achievable Region 

In this section, we describe conditions under which the achiev- 
able region described by 7Zi n in is optimal for a portion 
of the capacity region. In particular, we show that if (R p , R c ) 
lies on the boundary of the achievable region given by lZi n , 
then (Rp, R c ) lies on the boundary of R-p artout given by (fTSt 
for some a > 0. That is, for any // > 0, 



sup /iRp 



R c = inf 

a>0 



sup 



fj-Rp + R c - 



Then there exists a* € (0,oo) such that, for any fi > 1, 
(Rp,p, Rc,n) = argmax (/?p ^ c)eKin ^iR p + R c is a point on 
the boundary of the capacity region of the MIMO cognitive 
channel if the condition cTT~3T > is satisfied for a*. 



We denote by U a ch,rate, the set 
((R p , R c ), S p , S CjP , £ c , c , Q) given by 



of all 



(Rp, R c ), S p , S c p , S c c , Q 

Rp >o,R c > o, S P h o, S c ,p h o, s c , c t o 
Rp <log|l + GS p , net Gt 



log|l + H c , p S c , c Ht <p | 



H c ,pSc,cHj. p | 



Rc < log I 



J p,net 



h CiC x c>c hJ. 



S p Q 



>- 



(24) 



The rate pair that maximizes ^iR p +R c in the achievable region 
is given by solving the optimization problem 

sup [iR p + R c (25) 

((i?, p ,fl c ),Sp,S c , p ,S c , c ,Q) 

such that ((Rp, R c ), S p , S C)P , £ C:C , Q) 

^ 7^-ach,rate 

Tr(Sp) < P p 
Tr(S C:P + S c , c ) < P c . 



We define the functions L(R p , R c , S p , S c ,p, S c c , Ai, A2) and 
5(i?p,i? c ,Sp,S CiP ,S c , c ) as follows 

L(R p , R c , S p , S C! p, S c ,c, Ai, A2) = 

/ii? p + i? c -A 1 (Tr(S p )-P p )- (26) 
A 2 (Tr(S c , p + S c . c )-P c )) 

g(R p , R c , S p , S c p , S c c ) = 

min L(i?p,i? c ,Ep,E C p,S cc , Ai, A 2 ). (27) 

Ai>0,A 2 >0 

The optimization problem given by 

max 5(i? p ,i? c ,Sp,S CiP ,S CjC ) (28) 

(H p ,i? c ,S p ,S c , p ,S c . c ,Q) 

such that 



((R p , R c ), S p , £ C , P , X cc , Q) 

£ "Rach,rate 



has the same optimum value as that of ( f25l >. This is formally 
stated in the lemma below. 

Lemma 6.1: Let M denote the optimal value of the optimiza- 
tion problem defined in (l25l l. and U denote the optimal value 
of the optimization problem defined in d28l . Then, AI = U. 

Proof : We show that for any set of covariance matrices 
(S p , S c p , S c c ) that do not satisfy the power constraints 
given by ©, g(R p , R c , S p , S C)P , S c ,c) = -00. The power 
constraints can be violated by three means : 

. Tr(Sp) > P p and Tr(S c , p ) + Tr(S c , c ) < P c : In this 
case, Ai will take an arbitrarily large value and A2 = 
to drive g(R p , R c , £ p , S C , P , S c ,c) to -00. 

. Tr(Sp) < P p and Tr(£ c , p ) + Tr(£ C)C ) > P c : In this 
case, Ai = and A2 will take an arbitrarily large value 
to drive g(R p , R c , S p , £ C , P , S c , c ) to -00. 

. Tr(Sp) > P p and Tr(£ c , P ) + Tr(S C)C ) > P c : In this 
case, Ai and A2 will take arbitrarily large values to drive 
3(i? p ,i? c ,Sp,S Ci p,S CjC ) to -00. 

When both the covariance matrices satisfy the power 
constraints with inequality, then Ai = A2 = 0. This 
is because, Tr(S p ) - P p and Tr(£ CiP + £ c ,c) - P c 
are both negative. Hence, for any positive value 
of Ai or A 2 , L(R P , R c , £ p , £ C , P , S c c , Ai, A2) > 
L(R p , R c , S p , S c p , S c c , 0, 0). 
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When one of the power constraint is satisfied with equality, say 
Tr(Sp) — P p = and the other power constraint is satisfied 
with inequality Tr(S c p + S c ,c) — Pc < 0, then, we will have 
A2 = and Ai will be some real number. In any case, we still 



have Ai(Tr(S p ) - P p ) = A 2 (Tr(S c 



Pc) = 0. 



Similarly, when the first constraint is satisfied with inequality, 
and the second constraint satisfied with equality, we have 
Ai = and A2 is some non negative real number. We have 
Ai(Tt(Ep) - P P ) = A 2 (Tr(£ CjP + E c , c ) - P c ) = 0. 



Section 2.8]. The new space a G R + U {0, 00} is compact and 
Hausdorff. 

The optimization problem after changing the constraint set on 
a becomes 

Ni = inf sup nRp + R c (33) 

aGR+U{0,oo} ((R pi R a ) t Qp.Eo.c) 

such that ((Rp, R c ), Q p , S CjC ) G Ti^ art conv rate 
Tr(S c , c ) + Tr(Qp) < aP c + P p . 



Finally, if both the power constraints are satisfied with equal- 
ity, Ai and A2 are some non-negative real numbers. And 
Ai(Tir(£p) - P P ) = A 2 (Tr(S c , p + S C)C ) - P c ) = 0. 

Hence, in all the cases, the complementary slackness condi- 
tions are satisfied. Hence, the optimal solution of the opti- 
mization problem (l28l satisfy the power constraints and the 
objective function reduces to that of optimization problem 
d25| >. Hence, both the optimization problems have the same 
optimal values. That is, M = U. ■ 

Next, we find the optimum value of /j,R p + R c over all the 
rate pairs that are in the region TZ part out described by (fT2l . 
This is done by solving the following optimization problem: 



sup 

((fi p ,_R c ),Qp,£ c 

such that 



jiRp + R c 

) 

((Rp,R c ), 
Tr(£ CjC ) - 



(29) 



p ) ^c,c) 



Q 

- Tr(Qp) <aP c + P p 



£= 7?" 

^- '^part,conv,rate 



'^"part,conv .rate 



is the 
described by 



set of quadruples 



((-Rp, R c ), Q P , £ c ,. 

((Rp, R c ), Q P , S CiC ) : 
R P >0,R C > 0, Q P h 0, S c , c h 



R p <log|l + G Q Q p Gt Q 



1 1 \ 1 

jj-n-c.p-^cc 



-log I+^Hcp^, 



h ^,pI 



i? c <log|l + iH c 



(30) 



We let the optimal solution of 

Let N = min Q , > o N(a) and 



to be denoted by N(a) 



arg min N(a) 

Q>0 



(31) 



We show in Lemma 6.2 that a* e (0, 00) exists. Then, 
iV is given by the optimum value of the following inf sup 
optimization problem 



inf 

a>0 



sup 

((flp,fl c ),Q p ,S c 



fiRp + R c 



(32) 



such that ((Rp, R c ), Q p , E c , c ) e Tl% 



rt,co7iv,rate 



Tr(S c , c ) + Tr(Qp) < aP c + P p . 

The infimum constraint a > is not a compact set. We modify 
the constraint on a to a E M + U {0,oo}. This is done to 
compactify the set by adding two extra symbols and 00. 
The point zero is added to make the set closed. The process 
of adding the point 00 is called one point compactification. 
Details on one point compactification can be found in [24, 



We show that adding the two points and 00 to the constraint 
set on a does not change the optimum value of the optimiza- 
tion problem. This result is formally stated and proved in the 
following lemma. 

Lemma 6.2: The optimum value of the optimization problem 
given by d32l . N is equal to the optimum value of the 
optimization problem described by d33l l. N\. That is, N = N±. 

Proof: For any a G R + U {0,oo}, we let h(a) to denote the 
value of the inner sup problem. That is, 



h(a) 



■ sup 

((fl p ,i? c ),Q p ,£c 



fiRp + R c 



(34) 



such that ((.Rp, R c ), Q P! S C)C ) G TZ^ art 

,conv,rate 

Tr(S c , c ) + Tr(Qp) < P p + aP c . 
We show that liminf Q ^o h(ct) = liminf Q ,_» 00 h(a) = 00. 

Letting a — > 0, we put all the power in S c . c . That is, we 
choose S p = 0, S CiP = 0, Q = and £ c , c = Pp +^ c l na t . 
Also, we take 



R v = and R c 



log 



1 P p + aP c 

a n c ,t 



It follows from 423 that ((Rp, R c ), Q p , S C;C ) G 



K 



par t,conv, rate' 



Also, Tr(Qp) + Tr(S c , c ) 



P„ 



aP r 



Hence, ((Rp, R c ), Qp, S c c ) satisfy all the necessary 
constraints of ( |34l . Substituting these particular values of 
( (Rp, .R c ), Q p , S c ,c)> we get a lower bound on h(a). That 
is, 



lim inf h(a) > lim inf log 

a— >0 a— >0 



1 Pp + aP c 
a n ct 



Hc,cHt 



(35) 



Next, we look at the situation when a — * 00. In this case, 
we put all the power in S p . That is, we choose S p = 
"■I„ , S CiP = 0, S C:C = and Q = 0. We also 



n p,t 
choose 



R c = and R p = 



P P + aP c 



Ff Ff^ 



These values of ((-Rp, R c ), Q P , S c c ) satisfy all the necessary 
constraints of d34l l. Hence, we have 



lim inf h(a) > lim inf fi log 



a — >oo 
OO. 



aP c 



FT Ff^ 



(36) 
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Hence, h(a) = oo when a = or a = oo. Also, when 
a E R + , h(a) < oo. Hence, the optimum value of ( f33T > is 
reached when a is neither nor oo. Hence, N — N\. ■ 

As Q p is the covariance matrix of the codeword X(i),t = 
1, . . . , n for the primary user, it can be written as 



S P Q 

Q f £c,p 



(37) 



It is easy to see that the set Rp art conv described in (11) can 
also be written as 

i? c ), S p , S c p , Q, S c c ^ : 

R P >0,R C > 0, S p h 0, S c , p h 0, S c , c h 
R p < log |I + GQpGt + H c , p S c , c Ht p | 
-log|l + H C)P S C)C Ht iP | 

R c < log|l + H c , c S c , c Ht c | 
Tr(Sp) + Q'Tr(S c , p ) + aTr(S c , c ) < P p + aP c J 

where G = [H p p H c p ]. This is done by transforming 
Q,S cp ,S cc into y^Q, QiS C) p, aS C)C respectively. We de- 
fine lZ P art, conv. rate as the set described by 

((Rp, R c ), S p , S C , P , Q, ^c,c) ■ 

Rp >o,r c > o, s p y o, s c , p y o, s c , c y o 



R p < log 1 1 + GQpGt + H cp S c , c H 

-log|l + Hc,pSc,cHtp| 
fl c <log|l + H CiC S c , c Htl 



t 

c,p| 



Qi 



s P Q 

Q f S 



c.p 



(39) 



Hence, the optimization problem d33l can be written as 

N = inf sup fiRp + R c 

aSK+U{0,oo} ((fl pi fl ),5: pi E p ,Q,S c c ) 

(40) 

SUCh that ((Rp, Rc), ^p, ^c,p 5 Qi S c c ) G 'R-part,conv,rate 

Tr(Sp) + aTr(S c ,p) + aTr(S c , c ) < P p + aP c . 

We state the following lemma for switching min and max 
in minimax problems. The lemma is described and proved in 
Theorem 2 in [25]. 

Lemma 6.3: (Ky-Fan's minimax switching theorem [25, Thm. 
2]) Let X be a compact Hausdorff space and Y an arbitrary set 
(not topologized). Let / be a real-valued function on X x Y 
such that, for every y G Y, f(x,y) is lower semi continuous 
on X. If / is convex on X and concave on Y, then 

inf supf(x,y) = sup inf f(x,y)B ( 41 ) 

We see that the objective function fiR p + R c is concave 
with respect to the maximizing variables ((Rp, R c , Q P , S c c ) 
and convex with respect to the minimizing variable a. The 

2 In (49), the inf can be replaced with min, but we use inf throughout to 
maintain continuity and to avoid confusion. 



constraint space a £ R + U {0, oo} is compact and Hausdorff 
[24, Section 2.8]. 

Hence, all the conditions of the lemma are satisfied. Hence, 
by Ky-Fan's mini-max switching theorem [25], we can inter- 
change the sup and inf without affecting the optimum value. 
Hence, 

N = sup inf [iRp + R c 

((fi p ,fl c ).S p ,S c p,Q,S c c) aeK+U{0,oo} 

(42) 

such that ((Rp, R c ), S p , S c ,p, Q, £ c , c ) e TZ P art,conv ,rate 
Tr(Sp) + aTr(S c , p ) + aTr(£ c , c ) < P p + aP c . 

Similar to the functions L and g defined in d26*b and ( l27l ). 
we define the functions Li(R p , R c , S p , S c p , S c c , A, a) and 
5i(i?p,i? c ,Sp,S CiP ,S c ^,Q;) as follows 

L\(R P , R c , S p , S CiP , S c c , A, a) = fiR p + R c — 

A (^Tr(£p) + aTr(S c , p ) + aTr(S c , c ) - P p - aP c j , (43) 

gi(R P , R c , S p , S C , P , S c , c , a) = 

inf L x (R p , R c , S p , S c , p , S c , c , A, a). (44) 

We define the following optimization problem 

V= sup infgi(i? p ,i? c ,Sp,S CiP ,S CiC ,a) 

(45) 

such that ((Rp, R c ), S p , S C;P , Q, S CjC ) € TZ pa rt ,conv ,rate 
Q£l + U {0,oo}. 

Lemma 6.4: The optimum value of optimization problem 
d42l) . N is equal to the optimum value of the optimization 
problem (l45b . V. 

Proof : The proof of the lemma is along the same lines 
as the proof of Lemma 16.11 We show that for any set of 
covariance matrices S p , S c p and S c c that do not satisfy 
the power constraint Tr(S p ) + aTr(S c p ) + aTr(S c c ) < 
P p + aP c , 5ri(i?p,i? c ,S p ,S C;P ,S CiC ,a) = -oo. This is 
because, Tr(S p ) + aTr(S c , p ) + aTr(S CjC ) - P p - aP c is 
positive, and hence, A will take an arbitrarily high value to 
drive gi(R p , R c , S p , S C , P , S c , c , a) to — oo. Hence, the outer 
supremization problem will ensure that the power constraint 
is satisfied. 

Moreover, when the power constraints are satisfied 
with inequality, then Tr(S p ) + aTr(S c , p ) + 
aTr(S cc ) — P p — aP c is negative. Therefore, for any 
A > 0, we have Li(R p , R c , S p , S CiP , S C:C , A, a) > 
Li(R p , R c , S p , S c p , S c c , 0, a). Hence, A will take the value 
zero. When the power constraint is satisfied with equality, 
then Tr(Sp) + aTr(S c , p ) + aTr(S c , c ) - P p - aP c = 0. 
Then, A will take some non negative real number. Hence, 
the complementary slackness condition is satisfied. Hence, 
the optimal solution of the optimization problem satisfy the 
power constraint and the objective function reduces to that of 
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( l42l >. It follows that, the optimum value of the optimization 
problem (l42"T i. N is the same as the optimum value of the 
optimization problem d45l l, V. ■ 

Next, we show that the optimum value of the optimization 
problem (1281) . U is an upper bound on the optimal value of 
the optimization problem d45b . V. 



Lemma 6.5: The optimal value of ( 1281 . U is an upper bound 
on the optimal value of (l42l . V. 

Proof: Both the optimization problems are sup min problems. 
For any Ai > and A2 > 0, we can choose A = Ai and 
a = A2/A1 so that L 1 (R p , R c , £ p , S C , P , £ c ,c, A, a) = 
L(Rp,R c , S p ,S Cj p,S CiC , Ai,A 2 ). Hence, for any 
((R p , Rc), 5j p , S CjP , S c c ), 

inf Li(R p , R c , S p , £ c p , S c .c, A, a) < 

A>0,aGR+U{0,oo} 

inf L(J2p,i2c,£p,£c lP ,Ecc,Ai,A2). ( 46 > 

AISO, TZpart,conv,rate = 'R-ach.rate- Hence, it follows that V < 

u. m 



We can now prove Theorem 13.41 



Proof of Theorem 13.41 : Let p, > 1, The proof of the theorem 
follows directly from Lemmas |6. II 16.41 and 16.51 From Lemma 
16.11 we have that the optimum value of the optimization 
problem d25l l. M equals the optimum value of optimization 
problem d28l i, U. From Lemma \6A\ we have that the optimum 
value of optimization problem (PEL N equals the optimum 
value of the optimization problem d45l , V. M is the solution 
of the optimum pR p + R c over the achievable region and N is 
the solution of the optimum pR p +R c over TZ" t out described 
in ( TTSb . Hence if the condition given by (fL3b is satisfied for 
a* given by OTb . M < N. From Lemma 16.51 we also have 
V < U. Hence, we have that the optimal value of the original 
optimization problem (l25l l. M is equal to the optimal value 
of the optimization problem described by d42l , N. Hence, the 
achievable region lZi„ is /i-sum optimal. ■ 



VII. Numerical Results 

In this section, we provide some numerical results on the 
capacity region of the MIMO cognitive channel. We consider 
a MIMO cognitive system where the licensed and cognitive 
transmitters have one antenna each, and the licensed and 
cognitive receivers have one and two antennas respectively. We 
assume that the channel coefficients are real and also restrict 
ourself to real inputs and outputs. We generate the channel 
values randomly 



Hp, p = 1.4435, Hp. c 



H CiP = 0.799, H c ,, 



-0.3510 
0.6232 

0.9409 
-0.9921 



We assume a power constraint of 5 at the licensed and 
cognitive transmitters. In Figure 8, we plot the achievable 




Fig. 8. Plot of Achievable Region 1Zi n and partial outer bounds 7Zp art out 
for different values of a. 



Figure 8 shows how TZp art out intersects with TZi 
points for different values of a. 



at different 



Next, we find the maximum value of rate than can be supported 
by the licensed user in the example we considered. In both 
the achievable region and the outer bound, this corresponds to 
maximizing the /i-sum /j,R p + R c when \i — > 00. This would 
correspond to using all the power to support the licensed user. 
Note that the maximum value of R v in the set described by 
lZp art out is an upper bound on the maximum value of R p in 
the set lZi n for all values of a > 0, irrespective of the channel 
parameters. 

Maximizing R p over lZi n : The cognitive transmitter uses all 
its power for helping the licensed user. That is Tr(S c p ) = P c . 
This then reduces to a MIMO channel with channel matrix 
given by G = [ H p , p H c p ] . The licensed transmitter has 
a power constraint of P p and the cognitive transmitter has a 
power constraint of P c . Applying this to our example channel, 
we have G = [ 1.4435 0.799 1. The optimum covariance 
matrix is of the form 



J p,net 



5 bp 
5p 5 



region, lZi n and the region 1Z" 



part. out 



for different values of a. 



where p is the correlation between the two transmitters. 
Therefore, the rate achieved by the licensed user is 

R P (p) = ilog(l + GS p , net Gt). 

The maximum rate is attained at p = 1 and the maximum 
value of R p is 2.3542. 

Maximizing R p over 7Zp art out : For a given a, this 
reduces to a single user MIMO channel with G a = 
[ Hp P H CJ / \/a ] and a sum power constraint of P p + 
aP c . Note that, there is a significant difference between the 
two single user MIMO channels. The MIMO channel that 
we considered when solving the maximum value of R p in 
the achievable region had individual power constraints at 
the licensed and cognitive transmitters. However, the MIMO 
channel we obtain when solving for the maximum value 
of R p over Hp art out has a sum power constraint. This is 
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a conventional MIMO channel and the optimum covariance 
matrix is obtained by water-filling. For a given a, the best R p 
is got by 

maxima) = \ log |I + G a S Pinet G a | 
such that Tr(S p , ne t) < P p + aP c . 

It is easy to solve this problem if we look at the flipped channel 
Gjj. The capacity of the flipped channel is given by 

R p (a) = ilog|l + G^(P p + aP c )G a | 

= ilog(l + (P p + a P c )G Q Gt). 

Note that R p (a) is an outer bound on the maximum value 
of R p . The best upper bound is got by minimizing over all 
possible values of a. The optimum value of a is got by solving 
a cubic equation 2(0.799)V + (0.799) 2 a 2 - 1.4435 2 = 0, 
and its approximate value is 0.9689. 

VIII. Conclusions 

In this paper, we derived an achievable region, lZi n given by 
© and an outer bound, TZ"^ z given by © for the MIMO 
cognitive channel. We describe conditions when the achievable 
region is ^-sum optimal for any /i > 1. In particular, for any 
H>\, there exists a* € (0,oo), such that if the region given 
by 7Z part out optimizes the \i— sum rate of the SMBC (for that 
particular a*), then the achievable region achieves the {i-sum 
capacity of the MCC. 
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